Shape and scale in detecting disease clusters
نویسنده
چکیده
This dissertation offers a new cluster detection method. This method looks at the cluster detection problem from a new perspective. I change the question of “What do real clusters look like?” to the question of “What do spurious clusters look like?” and “How do spurious clusters affect the ability to recover real clusters?” Spurious clusters can be identified from their geographical characteristics. These are related to the spatial distribution of people at risk, the shape and scale of the geographic units used to aggregate these people, the shape and scale of the spatial configurations that the disease mapping or cluster detection method may impose on the data and the shape and scale of the area of increased risk. The statistical testing process may also create spurious clusters. I propose that the problem of spurious clusters can be resolved using a computational geographic approach. I argue that Monte Carlo simulations can be used to estimate the patterns of spurious clusters in any situation of interest given knowledge of the first three of these four determinants of spurious clusters. Then, given these determinants, where real measurements of disease or mortality are known, it is possible to show those areas of increased risk that are true clusters as opposed to those that are spurious clusters. This distinction is made in a three dimensional signature space, with shape, size and rate as the three axes. The extent of similarity (or dissimilarity) of a cluster to the simulated spurious cluster influences whether it can be recovered. These experiments show that this method is successful in detecting clusters. This method can also predict with reasonable certainty which clusters can be recovered, and which cannot. I compare this method with Rogerson’s Score statistic method. These comparisons expose the weaknesses of Rogerson’s method. Finally these two methods and the Spatial Scan Statistic are applied to searching for possible clusters of prostate cancer incidence in Iowa. The implications of the findings are discussed.
منابع مشابه
Speedy Object Detection based on Shape
This study is a part of design of an audio system for in-house object detection system for visually impaired, low vision personnel by birth or by an accident or due to old age. The input of the system will be scene and output as audio. Alert facility is provided based on severity levels of the objects (snake, broke glass etc) and also during difficulties. The study proposed techniques to provid...
متن کاملتحلیل زمان- مکان موارد سل ریوی در استان همدان با استفاده از خطر قابل انتساب جمعیت
Background and Objectives: One of the applications of population attributable risk percent (PAR%) is to estimate the disease burden in a population exposed to several risk factors. Therefore, this study was conducted to estimates the PAR% of the space-time clusters of pulmonary tuberculosis. Methods: In this study, the data of pulmonary TB cases were obtained from the health department of Ha...
متن کاملTemporal Changes of Effective Discharge for Suspended Sediments Transport in Korkorsar Watershed
Suspended sediment transport by rivers is important in water quality of cities and villages fed by certain drainage basins. On the other hand, the effective discharge refers to discharge that transports the most sediment over long time and controls the shape of river bed. Therefore, determination of effective discharge for suspended sediment transport and analysis the variation of effective dis...
متن کاملAn Optimization K-Modes Clustering Algorithm with Elephant Herding Optimization Algorithm for Crime Clustering
The detection and prevention of crime, in the past few decades, required several years of research and analysis. However, today, thanks to smart systems based on data mining techniques, it is possible to detect and prevent crime in a considerably less time. Classification and clustering-based smart techniques can classify and cluster the crime-related samples. The most important factor in the c...
متن کاملRefining membership degrees obtained from fuzzy C-means by re-fuzzification
Fuzzy C-mean (FCM) is the most well-known and widely-used fuzzy clustering algorithm. However, one of the weaknesses of the FCM is the way it assigns membership degrees to data which is based on the distance to the cluster centers. Unfortunately, the membership degrees are determined without considering the shape and density of the clusters. In this paper, we propose an algorithm which takes th...
متن کامل